Embedding similarity evaluation : English-Spanish

Level 0 : Basic Overview Level 1 : Retrieval Level 2 : Topic‑Level Level 3 : Error‑Type
Model Model Info Dist Chart Rand Chart Basic Stats CMC Curve Retrieval Stats UMAP (Topics & Lang1) UMAP (Topics & Lang2) UMAP (Sent‑Len) Topic Avg Cosine Error % Bar Chart
all_mpnet_base_v2
Statistic Value
arch mpnet
hidden_size 768
layers 12
vocab_size 30527
Statistic Value
mean_true 0.520555
std_true 0.223859
mean_random 0.152264
std_random 0.099044
snr 3.718436
ks_p_value 0.000000
Statistic Value
recall@k 0.483550
precision@k 0.096710
mean reciprocal rank 0.487856
all_MiniLM_L6_v2
Statistic Value
arch bert
hidden_size 384
layers 6
vocab_size 30522
Statistic Value
mean_true 0.463276
std_true 0.242676
mean_random 0.107584
std_random 0.091173
snr 3.901295
ks_p_value 0.000000
Statistic Value
recall@k 0.444600
precision@k 0.088920
mean reciprocal rank 0.455219
all_roberta_large_v1
Statistic Value
arch roberta
hidden_size 1024
layers 24
vocab_size 50265
Statistic Value
mean_true 0.602619
std_true 0.188348
mean_random 0.117541
std_random 0.096130
snr 5.046082
ks_p_value 0.000000
Statistic Value
recall@k 0.79290
precision@k 0.15858
mean reciprocal rank 0.74458
paraphrase_mpnet_base_v2
Statistic Value
arch mpnet
hidden_size 768
layers 12
vocab_size 30527
Statistic Value
mean_true 0.471597
std_true 0.233634
mean_random 0.163883
std_random 0.086582
snr 3.554028
ks_p_value 0.000000
Statistic Value
recall@k 0.358700
precision@k 0.071740
mean reciprocal rank 0.399813
paraphrase_MiniLM_L6_v2
Statistic Value
arch bert
hidden_size 384
layers 6
vocab_size 30522
Statistic Value
mean_true 0.423578
std_true 0.249959
mean_random 0.125499
std_random 0.097514
snr 3.056799
ks_p_value 0.000000
Statistic Value
recall@k 0.287600
precision@k 0.057520
mean reciprocal rank 0.352215
bert_base_nli_mean_tokens
Statistic Value
arch bert
hidden_size 768
layers 12
vocab_size 30522
Statistic Value
mean_true 0.598357
std_true 0.209622
mean_random 0.347858
std_random 0.131530
snr 1.904495
ks_p_value 0.000000
Statistic Value
recall@k 0.277100
precision@k 0.055420
mean reciprocal rank 0.352927
LaBSE
Statistic Value
arch bert
hidden_size 768
layers 12
vocab_size 501153
Statistic Value
mean_true 0.891042
std_true 0.084107
mean_random 0.186426
std_random 0.096587
snr 7.295161
ks_p_value 0.000000
Statistic Value
recall@k 0.987600
precision@k 0.197520
mean reciprocal rank 0.979883
distiluse_base_multilingual_cased_v2
Statistic Value
arch distilbert
hidden_size 768
layers 6
vocab_size 119547
Statistic Value
mean_true 0.898081
std_true 0.099776
mean_random 0.021226
std_random 0.081071
snr 10.815850
ks_p_value 0.000000
Statistic Value
recall@k 0.981950
precision@k 0.196390
mean reciprocal rank 0.972438
paraphrase_multilingual_MiniLM_L12_v2
Statistic Value
arch bert
hidden_size 384
layers 12
vocab_size 250037
Statistic Value
mean_true 0.890121
std_true 0.108192
mean_random 0.163694
std_random 0.130198
snr 5.579393
ks_p_value 0.000000
Statistic Value
recall@k 0.959050
precision@k 0.191810
mean reciprocal rank 0.947329
paraphrase_multilingual_mpnet_base_v2
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.918349
std_true 0.091276
mean_random 0.214925
std_random 0.124556
snr 5.647454
ks_p_value 0.000000
Statistic Value
recall@k 0.970000
precision@k 0.194000
mean reciprocal rank 0.959031
paraphrase_xlm_r_multilingual_v1
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.888385
std_true 0.099498
mean_random 0.183479
std_random 0.105966
snr 6.652215
ks_p_value 0.000000
Statistic Value
recall@k 0.976400
precision@k 0.195280
mean reciprocal rank 0.966757
xlm_r_distilroberta_base_paraphrase_v1
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.888385
std_true 0.099498
mean_random 0.183479
std_random 0.105966
snr 6.652215
ks_p_value 0.000000
Statistic Value
recall@k 0.976400
precision@k 0.195280
mean reciprocal rank 0.966757
stsb_xlm_r_multilingual
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.921425
std_true 0.091246
mean_random 0.176070
std_random 0.146825
snr 5.076479
ks_p_value 0.000000
Statistic Value
recall@k 0.969600
precision@k 0.193920
mean reciprocal rank 0.957799
xlm_r_bert_base_nli_stsb_mean_tokens
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.921425
std_true 0.091246
mean_random 0.176070
std_random 0.146825
snr 5.076479
ks_p_value 0.000000
Statistic Value
recall@k 0.969600
precision@k 0.193920
mean reciprocal rank 0.957799
xlm_r_100langs_bert_base_nli_stsb_mean_tokens
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.921425
std_true 0.091246
mean_random 0.176070
std_random 0.146825
snr 5.076479
ks_p_value 0.000000
Statistic Value
recall@k 0.969600
precision@k 0.193920
mean reciprocal rank 0.957799
xlm_r_100langs_bert_base_nli_mean_tokens
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.953912
std_true 0.060187
mean_random 0.346739
std_random 0.156871
snr 3.870531
ks_p_value 0.000000
Statistic Value
recall@k 0.962900
precision@k 0.192580
mean reciprocal rank 0.951571
distilbert_multilingual_nli_stsb_quora_ranking
Statistic Value
arch distilbert
hidden_size 768
layers 6
vocab_size 119547
Statistic Value
mean_true 0.970870
std_true 0.035256
mean_random 0.758981
std_random 0.071919
snr 2.946196
ks_p_value 0.000000
Statistic Value
recall@k 0.939850
precision@k 0.187970
mean reciprocal rank 0.923658
quora_distilbert_multilingual
Statistic Value
arch distilbert
hidden_size 768
layers 6
vocab_size 119547
Statistic Value
mean_true 0.970870
std_true 0.035256
mean_random 0.758981
std_random 0.071919
snr 2.946196
ks_p_value 0.000000
Statistic Value
recall@k 0.939850
precision@k 0.187970
mean reciprocal rank 0.923658
xlm_r_large_en_ko_nli_ststb
Statistic Value
arch xlm-roberta
hidden_size 1024
layers 24
vocab_size 250002
Statistic Value
mean_true 0.892495
std_true 0.117369
mean_random 0.182740
std_random 0.147920
snr 4.798225
ks_p_value 0.000000
Statistic Value
recall@k 0.936450
precision@k 0.187290
mean reciprocal rank 0.920795
xlm_r_base_en_ko_nli_ststb
Statistic Value
arch xlm-roberta
hidden_size 768
layers 12
vocab_size 250002
Statistic Value
mean_true 0.883468
std_true 0.105642
mean_random 0.220212
std_random 0.165127
snr 4.016651
ks_p_value 0.000000
Statistic Value
recall@k 0.930850
precision@k 0.186170
mean reciprocal rank 0.911277
clip_ViT_B_32_multilingual_v1
Statistic Value
arch distilbert
hidden_size 768
layers 6
vocab_size 119547
Statistic Value
mean_true 0.970577
std_true 0.025745
mean_random 0.826962
std_random 0.066664
snr 2.154303
ks_p_value 0.000000
Statistic Value
recall@k 0.766500
precision@k 0.153300
mean reciprocal rank 0.765431